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Abstract 

We propose a novel technique, termed compact shape trees, for 
computing correspondences of single-boundary 2-D shapes in 0(n 2 ) time. 
Together with zero or more features defined at each of n sample points on 
the shape’s boundary, the compact shape tree of a shape comprises the 
0(n) collection of vectors emanating from any of the sample points on 
the shape’s boundary to the rest of the sample points on the boundary. 
As it turns out, compact shape trees have a number of elegant properties 
both in the spatial and frequency domains. In particular, via a simple 
vector-algebraic argument, we show that the O(n) collection of vectors in 
a compact shape tree possesses at least the same discriminatory power as 
the 0(n 2 ) collection of lines emanating from each sample point to every 
other sample point on a shape’s boundary. In addition, we describe neat 
approaches for achieving scale and rotation invariance with compact shape 
trees in the spatial domain; by viewing compact shape trees as aperiodic 
discrete signals, we also prove scale and rotation invariance properties for 
them in the Fourier domain. Towards these, along the way, using concepts 
from differential geometry and the Calculus, we propose a novel theory for 
sampling 2-D shape boundaries in a scale and rotation invariant manner. 
Finally, we propose a number of shape recognition experiments to test the 
efficacy of our concept. 


1 Introduction 

Shape recognition is a consequential field of computer vision which studies, 
and seeks to automate, the processes by which previously known shapes can 
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be “known again.” Indeed, its importance is well underscored by how dated 
its mention occurs in the literature. For instance, Smith(2001) transmits the 
following implicit reference to it from Kitab al-Mannazir, an eleventh century 
book written by Ibn Al-Haytham, a prominent scholar of the Islamic Golden 
Age: 

Sight also perceives many things by means of recognition, so it rec¬ 
ognizes that a human is a human, that a horse is a horse, ... when 
it has seen the same thing before. 

Till date, research into shape recognition continues to receive significant atten¬ 
tion from computer scientists, cognitive psychologists, neuroscientists, as well 
as mathematicians. One famous method that was proposed quite recently —at 
the turn of the last decade —is shape contexts (Belongie, 2002). Shape contexts 
use a two-dimensional histogram of point counts to establish the correspondence 
between two shapes, and to compute a similarity score between the shapes. This 
concept paper proposes a shape descriptor that is very similar in spirit to shape 
contexts. As a tool for shape recognition, we propose the compact shape 
tree. Together with zero or more features defined at each of n sample points 
on the shape’s boundary, the compact shape tree of a shape comprises the 0{n) 
collection of vectors emanating from any of the sample points on the shape’s 
boundary to the rest of the sample points on the boundary. Indeed, Belongie 
et. al. already contemplated the use of the collection of the lines ( not vectors) 
emanating from any of the sample points on the shape’s boundary to the rest of 
the sample points on the boundary as an means for computing shape correspon¬ 
dences, towards recognition. However, they jettisoned it on grounds that it is 
likely to be unstable to intra-class variations. Albeit, the recent work of Wang 
et. al., who employed a very close variant of compact shape trees with suc¬ 
cess on well known shape databases, indicates that compact shape trees should 
be viable for shape recognition. Yet, there is a major difference between the 
approach taken here and those of Belongie et. al., and Wang et. al.. To com¬ 
pute the correspondences between two shapes, the approaches of those authors 
must ’’walk through” every sample point on the second shape for each sample 
point on the first, thereby resulting in an 0(n 3 ) time algorithm, where n is the 
number of sample points on each shape. To the contrary, our method walks 
through every sample point on the second shape for only one indeed any 
one —sample point on the first. Our algorithm begins by choosing any com¬ 
pact shape tree on the first shape. It then walks through all the sample points 
on the second shape, comparing the compact shape tree rooted at each point 
with the chosen compact shape tree of the first shape. The best matching com¬ 
pact shape tree on the second shape is then used to establish correspondences 
between the sample points of the two shapes. This results in an 0(n 2 ) time 
algorithm. It turns out that compact shape trees have a number of quite inter¬ 
esting properties. First, although the vectors in a compact shape tree form an 
0(n ) collection, they possess at least the same discriminatory power as shape 
forests, our terminology for the 0(n 2 ) collection of lines emanating from each 
sample point to every other sample point on a shape’s boundary. This is because 


2 



the length and angles of each line on the shape forest can be computed from 
the vectors comprising the compact shape tree. In fact, it should be possible 
to show that the compact shape tree is both a minimal and minimum ’’sub¬ 
set” of a shape forest with this property. More so, compact shape trees yield 
naturally to simple translation, scale, and rotation invariance techniques both 
in the frequency and spatial domains. In both domains, translation invariance 
is an intrinsic property of compact shape trees. In the frequency domain, we 
view the moduli and angles associated with the vectors comprising a compact 
shape tree as constituting aperiodic discrete signals. We then show that the 
scaled Fourier transforms of these signals is invariant to rotation and scaling. 
In the spatial domain, following Al-Ajlan, we achieve scale invariance simply by 
scaling the modulus of each vector in the compact shape tree by the maximum 
of the moduli. To achieve rotation invariance, we take the difference between 
the angles of consecutive —traversing the underlying shape’s boundary in a 
counter-clockwise manner —vectors in the compact shape tree. . 


2 Compact Shape Trees for Computing Corre¬ 
spondences 

We start by looking at compact shape trees consisting of vectors alone. Figures 
2a and 2b depict two shapes, each with eight sample points. Respectively, 
the sample points are pi,P 2 , ■■■,P8 and qi, < 72 , •••, qs', in practice, hundred sample 
points are typically used for better accuracy, so that the scenarios in Figures 2a 
and 2b are merely illustrative. Undoubtedly, the two shapes in the figures look 
very similar. If compact shape trees are anything to count on, then they must be 
able to capture the similarity between the two shapes. In Figure 2c, we exhibit, 
for the shape in Figure 2a, a compact shape tree rooted at point On the 
other hand, in Figure 2d, we depict for the shape in Figure 2b a compact shape 
tree rooted at point qs . It is obvious that the compact shape trees in Figures 2c 
and 2 d look very similar; they capture the similarity between the two underlying 
shapes. Indeed, if one were to consider any other compact shape tree on the 
shape in Figure 2b, one would find that none matches the compact shape tree of 
Figure 2c the way the compact shape tree of Figure 2d does. For instance, if we 
consider the compact shape tree rooted at 94 , shown in Figure 2e, we see clearly 
that it is poles apart from the structure in Figure 2c. Thus, it is only intuitive to 
conclude that the best matching compact shape tree, obtainable from the shape 
of Figure 2b, for the compact shape tree of Figure 2c is that in Figure 2d. From 
these pair of compact shape trees, one can easily obtain the correspondences: 
pi O ( 71 , P 2 O q 2 ,---P 8 qs, where ” 0 ” stands for ’’corresponds to.” We obtain 
ps 0 qs from the root of the trees, and obtain the remaining correspondences 
from the ’’tips” of the vectors. Let us now see what happens, if we consider any 
other compact shape tree on Figure 2a. Suppose we consider the compact shape 
tree rooted at P 4 (shown in Figure 2e). We should find that the best match to the 
tree of Figure 2e, from the compact shape trees of Figure 2b, is the one rooted at 
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( 74 - We show this compact shape tree in Figure 2f. Now, when we try to retrieve 
the correspondences between the compact shape trees of Figures 2e and 2f, we 
get: pi O Qi , P 2 o ( 72 , ■••P8 ^ Q 8 , which is exactly what we obtained when we 
used the tree rooted at p$ on the first tree. Based on this observation, we intuit 
that the choice of compact shape tree on the first shape should be immaterial; 
this is the most salient contribution of this paper, and this is what allows our 
method achieve its 0(n 2 ) run time property. Following Kimia et.al., Belongie et. 
al. and Wang et. ah, we are going to parlay tangency/curvature information 
into our model. We believe that tangency/curvature is a key visual feature 
that should be taken into account by any scheme aimed at the computation of 
correspondences. In what follows, we describe, at a rather formal level, how 
to match compact shape trees consisting of the vectors we have seen so far, 
along with curvature vectors and their moduli, measured at boundary sample 
points. Let Cp and Cq be twice differentiable simple closed curves defining 
the boundaries of two arbitrary shapes in the plane. In a counter-clockwise 
fashion, we mark equally-spaced sample points on each of the curves, obtaining 
for C-p, the set V = Pi,P 2 , ■■■■,Pm and for Cq the set Q = qi, < 72 , q n , where 
n is typically taken as 100. Next, we introduce C Pi = p t , p l , p 2 , ...,p n ~ l , which 
we call a rooted labeling of V with respect to p,. C Pi defines the order 
in which sample points are encountered on Cp, when traversed in a counter¬ 
clockwise manner, starting at Pi. We denote the compact shape tree rooted 
at pi by T Pi . Evidently,7"/, comprises n — 1 vectors. We label these vectors, 
Ui,U 2 , such that: the ’’tip” of iq is the sample point p 1 . Analogously, 

we may speak of tree T qi with n — 1 vectors: Vi,V 2 , Further, for each 

i € l,...,n— 1, the curvature at: p l is denoted Sp, the curvature at p t , the 
root of T Pi , is denoted S n . Similarly, the curvature at each q 1 is denoted Ki, 
while that at q t is denoted K n . With these, we can now define a tentative cost, 
C(T Pi ,Tqi), of matching T Pi and T q , as: 

n— 1 n 

C{T Pi ,T qi ) = w t ^ |m - Vi\ 2 + w 2 '^>2(6i - Ki) 2 ( 1 ) 

In the above equation, W\ and 11)2 are weights, which we leave up to training 
data to set for us. The first term of the equation measures a weighted sum of 
the square magnitudes of the differences between corresponding vectors in T Pi 
and T qi , while the second term measures a weighted sum of square Euclidean 
distances between curvatures. In preliminary experiments, we have noticed a 
problem with the discriminatory power of the second term of the above equation; 
we describe one way of fixing the problem in the next section. For now, we 
describe how to use the above equation. Of all compact shape trees from Q, 
the one that best matches the tree rooted at p t , is denoted T qi where <7 is the 
minimizer of C (' T Pi ,T qi )'. 


q = argrnin C (T Pi , T qi ) 

qi£Q 


( 2 ) 
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To retrieve the point correspondences being sought, we denote by q, q 1 , q 2 ..., q n ~ 1 
the rooted labeling of Q with respect to q. We imagine simultaneously travers¬ 
ing curves C-p and Cq, starting at pi and q respectively. The other in which we 
encounter sample points on both curves establishes the correspondences being 
sought. As depicted in Figure 2, we find: q O pi at the roots of Tq and T Pi ; and 
q l O p 1 , q 2 O p 2 , ...,g ™ -1 -O- p " _1 at the tips of the vectors in Tq and T Pi - 


3 A Problem With Distance Sums in Euclidean 
Space: the Curvature Case 


Herein, we describe the results of preliminary experiments that we performed 
on the curvature functions of ellipses. Consider Figure ELLIP. There are two 
half-ellipses in the figure, and the two of them can be parametrized with respect 
to angle 9: x = a cos 9 and y = bsin9. The first half-ellipse has parameters 
a = 3 and b = 7, while the second ellipse has a = 7 and b = 3. We now ask: 
which half-ellipse is most similar in “shape” to the straight line shown in the 
figure? Many human observers will definitely not say that both ellipses have 
the same level of similarity to the straight line. To our utmost dismay however, 
under zero-space sampling for continuous curves, the second term of Equation 
1 says they are. Although, we don not normally use zero-space sampling in 
practice, and more so the curves we work with in practice are not continuous, 
towards drawing lessons from it, it is nonetheless worthwhile to examine this 
problem in some depth. When we looked closely at the problem, we found it 
is does not stem from the choice of curvature as a feature used in Equation 1. 
Rather, it is a consequence of the phenomenon of distinct yet equidistant points 
in Euclidean space. Put clearly, the reason why the second term of Equation 
1 says that both half-ellipses have the same level of similarity to the straight 
line is that both half-ellipses are equidistant to the straight line in Euclidean 
curvature space. It turns out this issue is more mundane than it might at first 
seem, because it can be traced to the commutativity and associativity of the 
elementary addition operation: 2 + 3 + 10 = 10 + 2 + 3. It is appropriate that 
we describe our findings at a technical level. To begin with, in Mokhtarian and 
Mackworth (1992), the curvature function n{t) of a curve, C{t) = ( x(t ), y(t)), 
parametrized by variable f, is shown to be given by: 


x t (t)ytt{t) - y t (t)x tt (t) 
{x t (t) 2 + y t {t) 2 )i 


(3) 


As earlier hinted at, we parametrized the half-ellipses by angle 9: x(9 ) = a cos 9 
and y(9) = &sind, so that xg(9) = — asin0, xgg{9) = — acos9, ye(9) = bcos9, 
and yee(9) = —bsm9. Combining these, and allowing 9 to play the role of t in 
Equation 3 above, we get: 


k(9) 


ab sin 2 9 + ab cos 2 9 
(a 2 sin 2 9 + b 2 cos 2 9)i 


(4) 
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( 5 ) 


Using the identity, sin 2 9 + cos 2 9 = 1, we find: 


(a 2 sin 2 9 + b 2 cos 2 9) 2 

Advancing, we denote the curvature function of the straight line shown in Figure 
ELLIP by Ko, and Ko is everywhere zero, the line being straight. Further, we 
denote the curvature functions of the first and second half-ellipses by Ki(0) 
and «2 (9) respectively. Keeping in mind that a = 3 and b = 7 for the first 
half-ellipse, then according to the second term of Equation 1, the “difference” 
between the first half-ellipse and the straight line is given by: 


D! 


W2 


r < _hi_ 

J ' (3 2 sin 2 9 + 7 2 cos 2 9) i 


0 fd9 


( 6 ) 


In the same vein, the difference, as measured by the second term of Equation 
1 , between the second ellipse and the straight line is: 


D2 = W2 


r < _21_ 

J^l (7 2 sin 2 0 + 3 2 cos 2 9) § 


0 fd9 


(7) 


When we performed the integrations numerically using MATLAB function in¬ 
tegral, we found D\ = £> 2 , which implies W 2 D 1 = W 2 -D 2 . In fact, we found that 
this is always the case for any pair of half-ellipses, such that the first one has 
parameters a = ai, b = bi, and the second one has parameters a = b±,b = ai. 
In Figure ELLIPPLOTa, we plot a graph of D 2 — D\ + 5 against 9 for the 
half-ellipses of Figure ELLIP. As can be seen, the graph is everywhere equal 
to 5, an indication that D 1 = D 2 , and by extension W 2 D 1 = W 2 -D 2 . In Figure 
ELLIPPLOTb, we plot a similar graph for another pair of half-ellipses, the first 
having a = 17, b = 69, and the second having a = 69, b = 17. Again, we find 
that the graph stays at 5 through out. The key issue is how to deal with the 
problem described above. Our investigations revealed that one possible solution 
is to employ the natural logarithm of the first order moments of the squared 
distances between the curvature functions. We denote this M, and express it 
as: ^ 

M = log e ( f t [ K p (t) — K q (t) ] 2 dt) ( 8 ) 

Jt 1 

where n p (t) is the curvature function on the boundary of the first shape and 
K q {t) is the curvature function on the boundary of the second shape. We applied 
the above Equation to compare the two half-ellipses of Figure ELLIP with the 
straight line, and found that it is able to “discriminate” between them. In 
particular, for the first ellipse we find M =, while for the second ellipse we 
found M =. Furthermore, we observed that the quantity M increases as the 
“protrusion” of the half-ellipses increase. For example, M is greater for the 
second half-ellipse of Figure ELLIP than it is for the first, because the second 
half-ellipse is more protruded than the first. We present data to corroborate 
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this observation in Table 1. The above observations suggest that we replace the 
matching cost, C(T Pi ,T qi ), defined in Equation 1 , with a new cost, C(T Pi ,T qi ), 
defined as follows: 

n— 1 n n 

C('T Pi ,T qi ) = wi ^2 \ u i ~ v i\ 2 + w 2 y- m) 2 + log e (w 3 i(6i - m) 2 ) ( 9 ) 

i =1 i=l i=l 

The first two terms of the above equation are exactly the same as the two terms 
of Equation 1. The third term of the equation can be considered a discrete 
version of the right hand side of Equation 8 . Notice that, in the third term, 
the index i plays the role parameter t plays in the continuous version given in 
Equation 8 ; it is acceptable to parametrize discrete curves by indices. 


4 One Theory and One Conjecture about Com¬ 
pact Shape Trees 

In the previous section, we stated that when comparing two shapes, only one 
compact shape tree on the first shape needs to compared with the compact 
shape trees of the second shape. Is there any formal theoretical guarantee that 
there is no match leading to a different set of correspondences amongst the n — 1 
“uncompared” compact shape trees on the first shape? As suggested by Figure 
1, we believe there is none. However, because we do not yet have a proof at this 
time, we state this observation as a formal conjecture: 

Conjecture 1. Let V and Q respectively denote sets of n sample points on two 
shapes in the plane. For a given compact shape tree, T Pi = {pi,p x ,p 2 ,...,p n ~ x } C 
V, suppose, amongst all the compact shape trees obtainable from Q, the one that 
best matches T Pi is the tree T q = {q, q 1 , q 2 ,..., q n ^ 1 } C Q. Further suppose, for 
some other compact shape tree, T r = {r, r 1 , r 2 ,..., r™ -1 } C V,r ^ pi, amongst 
all the compact shape trees obtainable from Q, the one that best matches T r is 
the tree Ts = {s, s 1 , s 2 ,..., s™ -1 } C Q. Then, the correspondences, pi O ^p 1 O 
q r ,p 2 O q 2 , ...,p ™ -1 O q n ~ x , retrieved from matching T Pi andT q are exactly the 
same as (i.e. just a permutation of) the correspondences, r O s, r 1 O s 1 , r 2 O 
s , ...,r™ _1 O s” -1 , retrieved from matching T r and Ts- 

Also, in the Introduction section, we asserted that the vectors in a compact 
shape tree have at least the same discriminatory power as a shape forest, which 
is the 0 (n 2 ) collection of lines emanating from each sample point on a shape’s 
boundary to every other sample point on the boundary. We are now going to 
prove this claim after casting it formally thus: 

Theorem 1 . Respectively, let Tp and T be a compact shape tree and a shape 
forest obtained from n sample points of a simple closed curve. Further, let Ik and 
j3k be the length and angle, with respect to the positive x-axis, of the k-th line 
in T. Each ( Ik, (3k ) pair can be obtained from the vectors in Tp, either directly 
or via calculation, so that all the information in the 0 (n 2 ) collection, T, is also 
available in the 0 (n) collection of vectors in Tp. 
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Proof. 

Consider any two distinct points pi and pj in T, and denote by Ik and 8k the 
length and angle, with respect to the positive £-axis, of the line joining pi and 
Pj in T . There are only three possibilities: 1). pi = p, so that pj ^ p 2). 
Pj = p, so that pi ^ p 3). pi ^ p, and pj ^ p as well. For the first possibility, 
the directed edge ( Pi,Pj) can be written as the directed edge (p,Pj), which 
obviously is a vector, v, in Tp . Thus, Ik and /3k can be obtained from the polar 
representation of v. The second possibility is analogous to the first one. For the 
third possibility, we note that pi and pj must be the tips of two distinct vectors, 
V\ and V2, in Tp . Now, we simply set V3 = V2 — Vi, which says that V3 is a vector 
extending from pi to pj. It is clear that Ik and 8k can then be obtained from 
the polar representation of V3. □ 

5 Theories and Algorithms for Scale and Rota¬ 
tion Invariant Sampling 

In the previous sections, we spoke severally of sampling n points of the boundary 
of a shape. What exactly is the best way of choosing these sample points ? 
Although, Ling et. ah, Wang et. ah, and Belongie et. al have used boundary 
sample points in their works, they do not explicitly describe any algorithms 
for computing them, except that they said that the sample points are equally 
spaced along the boundary of the shape. In particular, they do not mention 
whether or not their sample points have been computed in a scale and rotation 
invariant manner, whereas it would be optimal to employ scale and rotation 
invariant sample points for computing shape correspondences. In this section, 
we are going to describe three possible approaches to computing sample points. 
The first approach is an heuristic which comes quite close to being rotation and 
scale invariant, while the latter two are based on theories from basic differential 
geometry and Calculus. We shall prove rigorously that, under fair assumptions, 
these latter two techniques lead to exact scale and rotation invariant sampling. 
We want to define what we mean by scale and rotation invariant sampling. In 
R 2 , we allow the operator £ 7 (.) to denote the operation of enlarging a shape 
by a factor of 7 . Also, we use R$ to denote the 2 by 2 matrix that abstracts 
rotation of shapes by <j> degrees in the plane, and use to denote this 

operation. Hence, if shape S is the result of first enlarging another shape, S , by 
a factor of 7 , and then rotating this intermediate result by </>, one may write: 
S = R c / > (£ 7 (S) ). Now, where V = pi,P 2 ,... ,p n is a sampling of n points on the 
boundary of <S, and V = Pi,P 2 , ■ ■ ■ ,Pn is a sampling of n points on the boundary 
of S , then we say that the samplings are rotation and scale invariant if V and V 
satisfy the bijection / : V —> V, in which /(.) = R^( £ 7 (.) ). Put simply, if the 
samplings are rotation and scale in invariant, then each point pi G V is mapped 
by Rcf,( £ 7 (.) ) to a point pj G V , and the point pj is mapped back to the point 
Pi by R^ ( £±(-) ), Rfp 1 being the inverse of R^. 

From the above definition, it would appear that, after sampling S to obtain 



V, then one can obtain V simply by applying the operation i?^( £ 7 (.) ) on the 
elements of V . However, this can lead to a vicious cycle: many approaches 
obtain R c / > (£ 1 (.)) by solving the shape correspondence problem between S and 
S , but to solve this shape correspondence problem, we need both V, and V 
ready. As we shall see, our theory of invariant sampling leads to an algorithm 
that does not need any knowledge of R ( / ) ( £ 7 (.)) to work. 


5.1 A Pseudo-Invariant Heuristic for Shape Boundary Sam¬ 
pling 


We describe a pseudo-invariant sampling algorithm that works on an heuris¬ 
tic rule. The algorithm expects as inputs a closed curve C, which forms the 
boundary of some shape 5, as well as the number, n, of sample points to be 
computed along C. The condition n = 2 fc , for some non-negative integer k, must 
be met, because the "hidden” objective of the algorithm is to recursively divide 
C into two equal halves until a stopping criterion is met. Although, ideally, 
the algorithm should output equally-spaced sample points, it turns out this is 
not the case for digital curves, due to the effects of digitization. To illustrate 
the essence of the method, we take shape S to be a disk having a circumference 
of 24cm. Also, we take 7 = 2 and <fr = 90° (actually, we know rotation does 
not have any real effect on circles), so that S = Rgo{ & 2 {S) ) is simply the disk 
whose circumference is 48cm. When n = 8 , for disk S , the algorithm computes 
a set of eight sample points V = Pi,P 2 , ■ ■ ■ ,Ps, such that the arc length, s, be¬ 
tween each pair of consecutive sample points on the disk’s boundary is given by 


24 - 

s = — = 3cm. Likewise, with n = 8 , for the second disk, S, the algorithm ob¬ 


tains a set of eight sample points, V = Pi,p 2 , • • ■ ,P8, such that the arc length, s, 
between each pair of consecutive sample points on the disk’s boundary is given 
48 

by s = — = 6 cm. In Figure 2, we show both disks and the sample points 


computed for them by the algorithm. From the figure, it is easy to see that a 
bijection, /(.) = R*( £ 2 (-) ), readily exists between V and V. Notice that this 
says that the algorithm has been able to compute rotation and scale invariant 
samplings for the disks. More importantly, it should not be hard to see that the 
algorithm will always compute rotation and scale invariant for any non-digitized 
shape. 

Albeit, we have identified two drawbacks of the algorithm. First, the require¬ 
ment n = 2 k is a very harsh one. For instance, suppose we wish to sample 
100 points, then, we must settle for either 2 6 = 64 points, or 2' = 128 points. 
Clearly, neither 64 nor 128 is a good approximation to 100. Thus, the con¬ 
dition n = 2 k embodies a really stringent constraint. The second drawback 
concerns digitalized shapes. This problem stems from the fact that pixel counts 
are always integers, so when the algorithm needs to pick the middle point of an 
open digital curve having a length of 24 pixels it must either pick the twelfth 
or thirteenth pixel. But, definitely, both pixels are neither the exact middle of 
the open curve. Well, we believe this does not lead to terrible deviations from 
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the invariant behavior of the algorithm we saw in the case of continuous curves. 
Thus, the must significant drawback is the first one. The next two sub-sections 
are devoted to discussions of theories and algorithms which do not suffer this 
drawback. 

5.2 A Theory of Invariant Boundary Sampling by Dis¬ 
tance Minimization: Part I 

The style of this sub-section will be largely formal. But, before the formalisms, 
we will give an overview of the technique being proposed. At base, the proposed 
technique harnesses the observation that distances from a shape’s boundary 
points to the shape’s centroid are invariant to rotation, and are simply multi¬ 
plied by the scaling factor, when the shape is scaled and/or rotated. That is, if 
D(s) is function that measures these distances before scaling and/or rotation, 
and D(s) is function that measures them after scaling and/or rotation, then the 
functions satisfy the relationship D(s) = 'yD(s), where 7 is the scaling factor. 
Using techniques from Calculus, we shall study some theoretical and practical 
implications of the above relationship. Specifically, we will show that the min¬ 
ima/maxima of D(s) correspond to the minima/maxima of Z?(s), under the 
rule s = 7 s. Practically, this means, for example, that if si is the only absolute 
minimum point of D(s) then Si = 7 S 1 will be the only absolute minimum of 
D(s). In this scenario, we can harness si and .§1 as seed points in our boundary 
sampling algorithm; the rest of the sample points are calculated with respect 
to si and Si respectively. Furthermore, as we shall see, by solving a simple 
distance-correspondence problem around the boundaries of the shapes, we are 
able to extend our algorithm to cover the case wherein D(s) and D(s) each have 
more than one absolute minima/maxima. We can now begin the formalism: 

Definition 1 . Let S be any shape in M 2 , and let S be the shape that results 
when S is enlarged by a factor of 7 and then rotated through </> radians in 
the plane. We shall express this as S = 7 Z ( / ) ( £ 7 (<S) ), where £ 7 (.) denotes the 
enlargement operation, denotes the rotation operation, and R<f, is the 

orthogonal rotation matrix associated with </>. Also, for any p € S, and any 
p € S, ifIZ^{ £ 7 (.) ) sends p to p, then we shall write p = £ 7 (p) )• Finally, 

if the curves C, and C are respectively the boundaries of S and S , then we shall 
indicate this via the expression C = £-y{C )) 

Theorem 2. Suppose C and C are two twice-differentiable simple closed curves 
in R 2 , such that C = R,p( £ 7 (C) ). Further, on C, let si be the arc length, 
measured counter-clockwise from some arbitrary point pi to another arbitrary 
point pj; and, on C, let Si be the arc length measured counter-clockwise from 
some arbitrary point pi to another arbitrary point pj. The following hold: 1). 
If Pi = R(f,( £~/{pi) ) and pj = i?^( £ 7 (pj)), then we must have si = 7 S 1 . 2). if 
Pi = R<t>{ £-y(Pi) ) and si = 7 S 1 , then we must have pj = R^( £j(pj) )■ 
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Proof. 

In what follows, on C , arc length, s, is measured counter-clockwise from pp, while, 
on C, arc length, s, is measured counter-clockwise from p,;. Beginning with the 
first part of the theorem, assume = Ry,( £ 7 {pi) ) and pj = Ry,( £ 7 (pj) ))• 
Parameterizing by s, one may write C(s) = ( x(s), y(s) ) T (Here, we are delib¬ 
erately using column vectors to make certain matrices compatible for multi¬ 
plication). Further, with ( x(s), y(s) ) T representing the point which R t / > ( £ 7 (.) 
sends ( :r(s), y(s) ) T to, we have ( i(s), y(s) ) T = Rj, y( a:(s), y(s) ) T , and 
C(s) = (x(s), y(s)) T = Rj,j(x(s), y(s )) T . Now, the quantity si can be obtained 
from, si = J 0 Sl ds. To obtain si, we start with the relationship, ds = |T(s)|(is, 
in which T(s) denotes the tangent vector, as a function of s, on curve C(s). One 
can compute T(s) by differentiating C(s) with respect to s: T(s) = C s (s) = 

y( a; g (s), y s (s) ) T . To compute the magnitude of T(s), we first write the ro¬ 
tation matrix, R^, explicitly as: Rj, = ^ cog^ ■ This allows us to see: 

, from which 

1 

we obtain, |T(s)| = ((7 x s cos </> + 7 y s sin <j)) 2 + (—727 sin </> + 7 y s cos <f >) 2 ) 2 = 
( 7 2 2 ; 2 cos 2 0+ 27 2 2 : s cos </> sin <^>+ 7 2 y 2 sin 2 <)>+ 7 2 2 ; 2 sin 2 </>— 2 r y 2 x s y s cos </> sin 4 >+ 

1 

7 2 y 2 cos 2 (j) ) 2 . Using the identity, sin 2 (j) + cos 2 (j) = 1 , we ultimately find: 

1 1 ' 

|T(s)| = 7(a: s (s) 2 + y s (s ) 2 )2. But, the quantity (a: s (s) 2 + y s (s ) 2 )2 equals 
|C s (s)|. Moreover, since s is the arc length parameter of C(s), we must have 
|C s (s)| = 1. In symbols we write, | T(s)\ = 7|C s (s)| = 7. Now, if one plugs this 
into the relationship, ds = |T(s)|ds, one gets: ds = 'yds. Finally, integrating 
this from s = 0 to s = Si, we get §1 = J Q Sl 'yds = 7 S 1 . For the second part of the 
theorem, we prove by contradiction. Assume pi = Rj>( £^{pi) ) and §1 = 7S1, 
but pj 7 ^ Ripi £-y(Pj) )■ But, R(f,( £ 7 (.) must map pj to some point, say pk, on 
the curve C. Hence, there exists a point pk ff Pj , such that pk = Rj,{ £-y(pj) )• 
By the first part of this theorem, it means that the arc length of pk is .sq. But, 
this is a contradiction because the two distinct points pk and pj cannot have 
the same arc length. □ 

The above theorem paves the way to the following theoretical result: 

Theorem 3. In R 2 , suppose C and C are twice differentiable simple closed 
curves, such that C = i?q( £ 1 (C) ); and suppose point Pi € C and point pi £ C 
are related by pi = Rj>( £ 1 {pi )). Further, for any two points pj £ C and pj £ C 
related by pj = Rj > (£ 1 (pj) ), let s be the arc length of C, measured (counter) 
clockwise frompt, and let s be the arc length ofC, measured (counter) clockwise 
from . Then, where St is the total arc length of C, and where §t is the total 
arc length of C, if G : [0, St]/{0, s*} —> R and G : [0, St]/{0, St} —> R are two 
functions satisfying G(s) = cG(s ) for some positive constant, c, the following 
must hold. 1). si is an absolute minimum point ofG(s) if and only if R ( j > {£ 1 (si)) 


T(s) = . 0 

w V — sin 0 


cos f 


sind 

cos# 


£s(s) 

ys{s) 
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is an absolute minimum point of G(s). 2). si is an absolute maximum point of 
G(s) if and only if R t /,( £ 7 (si)) is an absolute maximum point ofG(s). 

Proof. 

Theorem 2 enforces s = 7 s. We can use this to map sub-intervals of I = 
[0, s*]/{0, s t } to sub-intervals of I = [0, s t ]/{0, s t }, and vice versa. For example, 
[S 01 S 2 ] Cl is mapped to [so,S 2 ] C I under the condition that sq = jsq and 
S 2 = 7 S 2 . With this in mind, we proceed. We recall from Calculus the definition 
of an absolute minimum of function G(s) on interval I. If si is an absolute 
minimum of G(s) on /, then G(si) < G(s) for all s £ I. Since c > 0, we have 
cG(si) < cG(s) => G(Si) < G(s) for all s € I. As discussed above, the interval 
/ maps to the interval /, since 0 = 7 O and §t = 7 St- Consequently, we may 
declare: G(si) < G(s) for all s € I, which says that sq is an absolute minimum 
of G(s) on I. This proves the forward implication of the ”if and only if’ phrase 
in the first conclusion of the theorem. For sheer completeness, we spell out 
a proof of the reverse implication. Assume sq to be an absolute minimum of 

G(s) on /, so that G(Si) < G(s) => -G(§i) < -G(s) => G(si) < G(s) on 

c c 

I => G(si) < G(s) on I, the last implication being a consequence of mapping 
I to I. The proof of the second conclusion of the theorem is analogous to the 
theorem’s first conclusion, so do not write it out. □ 


With the aid of the just proven theorem, we can now describe a preliminary 
version of our scale and rotation invariant sampling algorithm. As we have 
been doing, we suppose S and S are two shapes satisfying S = 7 Z t j > ( £ 7 (<S) ), 
and that C and C are the boundaries of S and S respectively. Further, on C, an 
arbitrary point pi is chosen from which arc lengths, are measured, say, counter¬ 
clockwise. Similarly, on C, the point pi satisfying p^ = £ 7 ( f) ) is chosen 

V 


for the same purpose. Arc lengths are denoted s on C, and denoted s on C. 
We now let D(s) denote the distance from the centroid of S to the point with 
arc length s on C. In the same vein, we let D{s) denote the distance from the 
centroid of S to the point with arc length s on C. As mentioned at the opening 
of this sub-section D(s) and D(s) are related by D(s) = 7 D(s). Therefore, 
by Theorem 4, the absolute minima/maxima of D(s) must correspond to the 
absolute minima/maxima of D(s), under the rule s — 7 s. The “preliminary 
version” of our algorithm is for the case wherein both D(s) and D(s) each have 
a either a single absolute minimum point or a single absolute maximum point. 
The algorithm outputs a set V = Pi,P2, ■■,Pn of n sample points when invoked 
with S and n; and outputs a set V = Pi,P2, ■ • • ,Pn of n when called with S and 
n. Now, suppose sq and A are respectively the “single points” at which D{s) 
and D(s) reach their absolute minima. Further, suppose C and C have total arc 
lengths St and s t respectively. Our algorithm simply takes Si and jd as seed 
points, and this means si and .§1 correspond to the first sample points computed 


for C and C. Next, the algorithm computes the quantities w = — and w = —. 

n n 

With these, for C, it is able to compute V = Pi,P2,-,Pm such that pi € R 2 
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corresponds to the seed point on C, and each p, G R 2 , i G 2,,.., n, is at an 
arc length distance of w from pi-i. Clearly, this leads to equal-spaced sampling 
of curve C; See Figure. Analogously, for C, the algorithm is able to compute 
V = pi,p 2 , ■ ■ ■ ,p n such that p\ G R 2 corresponds to the seed point on C, and 
each pi G R 2 , i G 2,..., n, is the point that is at an arc length distance of w from 
pi. Similar to the case of C, this leads to equal-spaced sampling of curve C. We 
look at the situation in some more details. First, Theorem 4, says that p\ and p\ 
must satisfy ~ p = £ 7 ( ). Further, Theorem 2 implies §t = 'yst, which 

OO V 

s s 

further implies w = — = 7 — = 7 w, so that each pi is at an arc length distance of 
n n 

7 (i — l)to fromp 1; while, as earlier noted, each Pi, i G 2,..., n, is at an arc length 
distance of (i — l)w from pi. This, together with the fact ~ p = 1Z$ (£ 7 ( 00 )) as 

OO V 

well as the second part of Theorem 2, now inform us that for each i G 2,..., n, 
the relationship ~p = TZ^ £ 7 ( a) ) is in force. Consequently, our ’’preliminary 
> . V 

algorithm,” achieves rotation and scale invariant sampling. Furthermore, it also 
obliterates the harsh n = 2 k requirement of the method described in the previous 
sub-section. On the flip side though, in its present form, the algorithm is able 
to deal only with continuous curves. However, it should not be very difficult to 
adapt it for digital curves. In particular, this will involve the use of techniques 
specialized for the estimation of arc length on digital curves. Fortunately, there 
exist several such techniques in the literature (). We outline our preliminary 
algorithm as Algorithm 1 below. 

5.3 A Theory of Invariant Boundary Sampling by Dis¬ 
tance Minimization: Part II 

In this sub-section, we attempt to extend our curve boundary sampling algo¬ 
rithm to cover cases wherein the distance functions, D(s) and D(s), do not 
have unique points of absolute minimum/maximum. Our approach will involve 
solving a simple arc length correspondence problem around the boundaries of 
the involved curves at recognition time. To this end, let A = {ai, < 22 ,..., a^}, 
L > 1, cii G mathbbR 2 , be the set of points at which D(s) reaches absolute min¬ 
imum, and let A = {ai,a 2 , ■ • • ,ul} be the set of points at which D(s) reaches 
absolute minimum. An implication of Theorem 4 is that there exists a bijection, 
/ from set A to set A. Our goal is to find this bijection. To illustrate this goal, 
consider Figure X, a depiction of two shapes S and 5, satisfying C = R ( j > {£ 1 {C)). 
A glance at the figure reveals that the bijection, /, we seek can be written in 
an ordered set T as: T = {01 PP 03,02 <rP 01,03 PP 02 }, where the “order” 
in T comes from the fact that the correspondences in it are written in counter 
clockwise fashion around the boundaries of S and S. By Theorem 2, there 
is a natural relationship between the arc lengths of consecutive members of 
T. For instance, if we consider the second and third members of J r , that is, 
02 <rP oi and 03 « 2 , and denote the arc length from point 02 to 03 by s, and 
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S 1 

that from point di to <22 is s, then Theorem 2 tells us that s = js => - = —. 

s 7 

Furthermore, with C being the boundary of S; St denoting the total arc length 
of C; C representing the boundary of S; and St standing for the total arc length 

of C , we also have s t = jst => — = —. Thus, we find - = — = So, we 

St 7 s 7_ s t 

S St s s s s 

see - = — implying — = —. Consequently, we have (-— ) 2 = 0. It is 

S St St St St 

this final “equality to zero” that forms the basis of our method. In particular, 
notice that since our choice of the second and third members of T was arbitrary, 
it follows that the “equality to zero” holds for all consecutive members of T. 
We formalize this observation below. Consider any a, £ A (set A is as defined 
above) and let us define the ordered set A ai = {o*, a 1 , a 2 ,..., a i_1 }, such that 
A ai captures the order in which points in set A are to be encountered on the 
curve C, when C is traversed in a counter clockwise manner starting out at at. 
With this, we can define an arc length set, B ai = {sj., s 2 .,..., }, in which 

Sq. , k £ { 1 , 2 ,..., L — 1 } is the arc length between (J !: and its predecessor in 
the ordered set A ai . Notice that this leaves s% undefined. We simply define 
s%. as the arc length from a L back to a*. Analogously, for curve C, we define 
A aj = {a,j, a 1 , a 2 ,..., a i ” 1 } and B aj = {s^., s„.,..., }. With these, we com¬ 

pute C(ai,dj), the arc length correspondence score between point a,: £ A C C 
and point dj £ A C C as follows: 


C(a l ,a i )=^(^-^i) 2 (10) 

, . S t S t 

k= 1 

From our earlier discussion, it should be clear that if the ordered sets A ai and 
Aa^ form a bijection, then C(a,i, dj) must equal zero. In this event, we can simply 
take points ai and dj as seed points for curves C C respectively. Having done so, 
the rest of the processing, towards obtaining scale and rotation invariant sample 
points, then becomes identical to Algorithm 1. With this, it is clear that we 
have been able to achieve a solution for the case wherein the functions D(s ) 
and D(s) each have more than one point of absolute minimum/maximum. Our 
solution is delineated in Algorithm 2 below. Again, while Algorithm 2 is able 
to efface the n = 2 k requirement of the method presented in Section 4.1, it will 
still need some adaptations to handle digital curves. 

5.4 Scale and Rotation Invariant Sampling By Critical 
Points of Shape Curvature 

In this section, we describe a technique for invariant shape boundary sampling 
using the local maxima of the curvature function of the shape. Indeed, Xie 
et. al. (2007) have already described this sort of sampling technique, and they 
mentioned that it generally does not lead to equal-spaced sampling, unlike the 
distance minimization technique described in the previous section. However, 
they did not mention whether it is scale and rotation invariant or not. We will 
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now prove that it indeed is scale and rotation invariant. Towards this, we prove 
two theorems. The first theorem captures the effect of scaling and rotation on 
curvature, while the second theorem underpins the effect of scaling and rotation 
on the critical points of functions, such as the curvature function, defined along 
shape boundaries: 

Theorem 4. Suppose S and S are two shapes in R, such that S = Rj,(£^(S)); 
and suppose C and C are twice differentiable simple closed curves delineating the 
boundaries of S and S respectively. Further, let s be the arc length of C, mea¬ 
sured counter-clockwise from some point pi on C; and let s be the arc length of 
C measured counter-clockwise from the point pi — R c / > ( T-yipt) ). Then, for any 
point pj on C, the curvature vector at the point R^( £-y(pj) ) is 7 divided by the 
curvature vector at the point p t . 


Proof. We set p, = S 7 (pi) and pj = Sj(pj). The arc length at Pj is denoted s 
and that at R ( / ) ( £ 7 {Pj) ) is denoted s. So, from Theorem 2, we know s = — s. 

1 7 

Differentiating this with respect to s, we get sg — — ■ Now one may write, 

7 

C(s) = C(s ) = R ! p( 7 x(s), 7 y(s) ) T . Differentiating with respect to s, we get: 
Cs[s) = Cs{s ) = R</>{ jx s (s)ss, 7 y s {s)ss ) T = R</>{ x s (s), y s {s) ) T , the last 
equality following from the fact that Sg = —. Differentiating the above result 

:h 

with respect to s, we now find: Css(s) = Csg(s) = R^,( x ss (s)sg, y S s(s)sg ) T = 

Rch( -Xss(s), -y ss (s ) ) T = i? 0 -( x ss (s), y ss (s) ) T . This proves the theorem, 
7 7 7 

since R ( / > ( x ss (s), y ss (s ) ) T is the curvature vector of C(s). In closing, we point 
out that the given proof makes implicit use of the fact that both |C s (s)| and 
|C S -(3)| are equal to unity. □ 

The second theorem now follows: 

Theorem 5. In R 2 , suppose C and C are twice differentiable simple closed 
curves, such that C = R^( £ 7 (C) ); and suppose point Pi € C and point pi £ C 
are related by pi = R c / > ( £ 7 {pi) ). Further, for any two points pj £ C and pj £ C 
related by pj = R ( j{£ 1 {pj) ), let s be the arc length of C, measured (counter) 
clockwise frompi, and let s be the arc length ofC, measured (counter) clockwise 
from pi. Then, where St is the total arc length of C, and where §t is the total 
arc length of C, if G : [0, s*]/{0, s*} —> R and G : [0, St]/{0, St} —> R are two 
functions satisfying G(s) = cG(s ) for some positive constant, c, the following 
must hold. 1). si is a minimum point of G(s) if and only if R c / > (£~ t (s 1 )) is a 
local minimum point of G(s). 2). Si is a maximum point of G(s) if and only if 
Rj,( £ 7 (s 1 )) is a local maximum point of G(s). 3). si is an inflection point of 
G(s ) if and only if R ( j{£ 1 (s\)) is an inflection point ofG(s). 
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Given below, the proof of the above theorem makes use of the fact that if si 
is a critical point of D(s), and D k (s i) is the first non-zero derivative of D(s) at 
Si, then si is a local minimum/maximum point of D(s) if k is even and D k (s i) 
is positive/negative. If k is odd, then si is an inflection point. 


Proof. 

Firstly, observe that Theorem 2 above tells us that s and s are related by 

s = 7 s => s = —s. Differentiating with respect to s, we have sg = —. Now, 
7 7 

differentiating G(s) = cG(s) with respect to s, we write: Gg(s) = cGg(s) = 

cG s (s)ss = —cG s (s). Summarily, we have found Gg(s) = —cG s (s). Since c and 


7 are non-zero, it follows that Gg(s) = 0 if and only if G s (s) = 0. This means, 
if we consider two points, si € I and si G /, such that Si = 7 s 1 , then si is a 
critical point of G(s) if and only if si is a critical point of G(s). To proceed, 

we differentiate Gg(s) = — cGAs ) with respect to s, Ggg(s) = —cG s g(s) = 

1 1 7 1 7 

—cG ss (s)sg = -^jcGss(s). So, we have found, Ggg(s) = ~^cG ss (s). Indeed, 

sy y^ y^ 

if one continues to differentiate, one will find: G k (s ) = —cG fc (s)sg, where 

G fc (s) is the fc-th derivative of G(s) with respect to s, and G k (s) is the fc-th 
derivative of G(s) with respect to s. Now, since c and 7 are positive, it follows 
that G fe (si) > 0 if and only if G fc (si) > 0. Similarly, G k (Si) < 0 if and only 
if G fc (si) < 0. Also, G k (s 1 ) = 0 if and only if G fc (si) = 0. Thus, G k (si) is 
the first positive/negative/zero derivative of G(s) at Si if and only if G k (s 1 ) is 
the first positive/negative/zero derivative of G(s) at si. In particular, if k is 
even and G fc (s~i) is the first positive derivative of G(s) at si, then si is a local 
minimum point of G(sJ and si is a local minimum point of G(s). On the flip 
side, if k is even and G fc (s~i) is the first negative derivative of G(s) at si, then 
Si is a local maximum point of G(s) and si is a local maximum point of G(s). 
Finally, if k is odd and G fe (s~i) is the first non-zero derivative of G(s) at si, then 
Si is an inflection point of G(s) and Si is an inflection point of G(s) □ 


We can now explore ways of exploiting the above theorems for the task at 
hand. First, notice from Theorem 4, that if k(s) is the curvature function on 
C(s) and k(s) is the curvature function on C(s), then we must have k(s) = cn(s) 

where c = —. Thus, k(s) and k(s) can respectively play the roles of Z(s) and 
7 

Z{s) in Theorem 5. So, according to Theorem 5, we see that k(s) and k(s) 
must have the same number of points of local maximum. Let this number of 
points be L. If L = 1, we simply take this single point of local maximum as 
the seed point and proceed as described in Section 4.2. However, if L > 1, then 
we must solve an arc length correspondence problem embodied by Equation 3. 
This allows us to compute L scale and rotation invariant sample points around 
C and C. Now, suppose we need n sample points, and we found that n L. If 
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n < L , we simply discard L — n of the already computed points. But, if n > L, 
then we must compute more sample points. One way to do this is to look at 
the smallest interval on each curve, and then use the length of this interval to 
mark off more points on the curve, beginning from the largest interval to the 
smallest one, until a total of n sample points can be found on the curve. We 
describe an algorithm that encompasses our local curvature maxima sampling 
technique as Algorithm 3 below.S 

6 Theories and Methods for Achieving Invari¬ 
ance with Compact Shape Trees: Spatial Do¬ 
main 

In the first place, translation invariance is an innate characteristic of compact 
shape trees. Moreover, in both the spatial and frequency domains, compact 
Shape trees lend themselves to simple techniques for achieving scale and rotation 
invariance. Beginning with the spatial domain, we detail these techniques in this 
section. Let T Pt = (V Pi , fi,f 2 , ■■■, fm) be a compact shape tree. We consider first 
how to achieve rotation invariance for the vector tree, V Pi , noting that rotation 
does not affect the moduli of the vectors in V Pi , but definitely does affects their 
angles. In vector notation, suppose V Pi = {vi,V 2 , To achieve rotation 

invariance, we simply take the differences of the angles of consecutive vectors 
in V Pi . Formally, assuming each i>j is in polar form, making angle 9i with the 
positive 21 -axis, and having magnitude r,;, it should be obvious that the ordered 
set S a = {cq+i — on | i £ Z + ,i < n— l,a n = oq} is rotation invariant. For 
instance, when the underlying shape is rotated through angle (j), each vector, 
Vi, in V Pi also rotates through (j>. If one denotes the new angle of Vi, after the 
rotation, by cq = at + </>, and defines the set Sg, = {cq+i — cq | i £ Z + ,i < 
n — l,a n = ai}, one easily finds Sa = S a . It remains to discuss rotation 
invariance for the features fi, f 2 , • • •, fm in T Pi . From a previous discussion, 
we have seen that these features can either be vectors or scalars. Analysis 
for the case when an fi is a vector is similar to the analysis for vector trees 
above. We deal only with the case wherein rotation preserves the angle made by 
vector fi and the underlying shape boundary’s tangent direction at the sample 
point, pi, where fi is measured. It turns out this is the case for the vectorial 
features, such as the curvature vector and the tangent vector, we are interested 
in; the curvature vector will always make an angle of ±90 degrees with the 
tangent vector, while the tangent vector will of course always make an angle 
of zero degrees with itself. Under this restriction, it can be shown that when 
the underlying shape rotates, say, through the vectorial feature fi also rotates 
through (f>. Thus, we can again achieve rotation invariance for each vectorial fi 
by taking the consecutive differences of its angles, as was done for vector trees 
above. The case when a feature / is particularly easy to handle. We first bring 
to mind the point notation V Pi = {pi,p l ,p 2 , ...,p n ~ 1 }. With this, one sees that, 
where f Pi is the value of feature / at point pt, and f l is the value of / at p l , then 
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the ordered set Sf = {f Pi , f 1 , / 2 ,/ n_1 } is rotation invariant. We now turn 
to the topic of scale invariance in the spatial domain. Clearly, scaling does not 
affect the angles of the vectors in V Pi = {v\,V 2 , ...v n -i}, but does affects their 
moduli by scaling them. A well known strategy for achieving scale invariance in 
this kind of situation is to divide each member in the collection of these moduli 
by the maximum of the collection (Alajlan, Wang). But, we must ask about 
the features in T Pi . Although we restrict attention to only the curvature vector 
(and its magnitude), we still need a lemma to proceed: 


7 Theories and Methods for Achieving Invari¬ 
ance with Compact Shape Trees: Frequency 
Domain 


As mentioned in our introduction, compact shape trees lend themselves to 
analysis in the frequency domain. In particular, it is possible to extract a 
scale and rotation invariant version of the underlying vector tree, V Pi , in any 
compact shape tree. In what follows, we explicitly separate the ordered set 
V Pi = {vi,V2, ■ ■ ■ ,v n -i} into two ordered sets, = {e® 1 , e® 2 ,..., e 0n ~ 1 }, and 
r = {Zi, I 2 , ■ ■ ■, where is the angle the vector Vi € V Pi makes with 

the positive x-axis, and f is the length of 'ty. We may consider the aperiodic 
discrete Fourier transform, Xg(ui), of the sequence of exponentiated angles in 
<P, and write: 

n— 1 

X e (w) = '52e 0t e- iui ( 11 ) 

i =1 

where j = y/—l Analogously, we may denote the aperiodic discrete Fourier 
transform of the sequence of moduli in set r by Xi(ui), and write: 

n— 1 

X l {cj) = J2he- jui ( 12 ) 

*=1 


At this juncture, bring to mind the physical fact that rotation affects only the 
angles in a vector tree; as far as rotation is concerned, we need only bother about 
the ordered set F = {e Bl , e 6 * 2 ,..., e 61 ”- 1 }. Using Equation 11, we can state and 
prove a theorem about the rotation invariance of an arbitrary vector tree, V Pi : 


Theorem 6. Suppose S and S are two shapes such that S is the result of rotat¬ 
ing S through angle <fi in the plane; and let Xg(ui) and Xg(ui) be the aperiodic 
discrete Fourier transforms associated with S and S respectively. Then, for any 

Xe{oj\) Xq{uj 1 ) 


lo, we have Xg(ui) = e^Xgfjj) and for any u>i,u> 2 , we have 


which says that the quantity 

S. 


Xe(wi) 

Xg(uj 2 ) 


Xg[uJ2 ) Xg{fjJ2)' 


is a rotation invariant property of the shape 


18 



Proof. 

The sequence of exponentiated angles associated with the vector tree of shape S 
is in the ordered set = {e 91 , e 6 * 2 ,..., e 9n ~ 1 }. When S is rotated through </> de¬ 
grees, each Oi must increase by (j>. Hence, the ordered set of exponentiated angles 
associated with shape S is <P = {e^ +ei , e^ +S2 ,... ,e^ >+9n - 1 }. Now, the Fourier 
transform of this sequence is Xg(cu) = Y^i=i e‘^ + 6 ,i e _ ' 7a ” = e^e^e - - 7 "* = 

e^ EITi 1 e^e - - 7 "* = e^Xg{u>). From this, we see that Xg(cv) = e^Xg(u;), for any 

u). Thus, for a specific oq, we may write Xg(wi) — e^Xghwi) => Xg(w\) _ ^ 

Xg{uj i) 


Similarly, for another specific ui 2 , one finds 

0 _ Xg(uj 2 ) _ Xg(uJi) Xg(ui 1 ) 

Xg(uJ2) Xg(uj 1 ) Xg(u> 2 ) 


Xg(u> 2 ) 
Xg(u>2 ) 
XgM 
Xg(uJ2) 


e*. Consequently, one 

□ 


Remark 1. The above proof should reveal why we exponentiated the angles be¬ 
fore taking their Fourier transform. This is a deliberate arrangement aimed at 
achieving rotation invariance in the manner described by Theorem 6 . Fortu¬ 
nately, the function f( 6 ) = e 9 turns out to be a good choice for this purpose, 
because it is a one-to-one function. To see why we need a one-to-one function 
here, consider g( 0 ) = sin# which is not one-to-one, and notice that it dimin¬ 
ishes the discriminatory power of the sequence, 6 \, 62 , ■ ■ ■, 0 n , as a feature set; 
for instance, g( 6 ) = sin# maps both j and ^ to the same value! 

We turn to the issue of scale invariance. Scaling affects only the lengths of 
the vectors in a vector tree. So, as far as scaling is concerned, we need only 
worry about the ordered set, r = {h, I 2 , ■ ■ ■, In- 1 }- The following theorem tells 
us how to achieve scale invariance for a vector tree: 


Theorem 7. Suppose S and S are two shapes such that <S is the result of scaling 
S by a factor of 7 ; and let Xi (u>) and Xi (u>) be the aperiodic discrete Fourier 
transforms associated with S and S respectively. Then, for any to, we have 

Xi( Ul ) Xtfa) 


Xi(u>) = 7 Xi(oj) and for any u)i,u> 2 , we have 


that the quantity 


Xl{U!) 

Xi(w 2 ) 


Xi[w 2 ) Xi(oj 2 ) 


which says 


is a scale invariant property of the shape S. 


Proof. 

The sequence of lengths associated with the vector tree of shape S is in the 
ordered set r = {h,h, ■ ■ ■, l n - 1 }- When S is scaled by a factor of 7 , each l, 
must be scaled by 7 . Hence, the ordered set of lengths associated with shape 
S is r = {jlij'yh, ■ ■ • , 7 / 71-1 }• Now, the Fourier transform of this sequence is 
Xi(oj) = = 7I*"*" = 7 Xi{uj). From this, we see that 

Xi(ui) = 7 Xi(ui), for any ui. Thus, for a specific uq, we may write Xi(ui 1 ) = 


7E(wi) 


Xi( oq) 
Xi(u 1) 


= 7 . Similarly, for another specific uq, one finds 


Xl(L02) 

X t {U2) 


19 



7 - Consequently, one sees, 7 


□ 


= Xi( Ul ) ^ Xtfa) = Xfa 1 ) 

X 1 ( 0 : 2 ) Xi(oj 1 ) Xi(oj2) X 1 ( 0 : 2 ) 
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